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SUMMARY 


This  research  examines  the  phenomenon  of  internal  attention  shifting; 
that  is,  paying  attention  to  different  places  in  the  visual  field  without 
changing  the  direction  that  the  eyes  are  pointing.  Recently,  some 
researchers  have  suggested  that  sequences  of  internal  attention  shifts  may 
be  a  necessary  part  of  visual  perception.  This  seems  to  require  that 
attention  be  shifted  very  rapidly,  since  a  complex  scene  can  often  be  seen 
with  only  brief  presentation.  The  purpose  of  the  experiments  in  tin's 
report  was  to  find  out  how  fast  such  attention  shifts  are.  The  results 
indicated  that  it  is  possible  to  shift  attention  from  one  visual  location 
to  another  in  less  than  68  milliseconds,  but  that  vision  continues  to 
improve  for  120-150  milliseconds  after  a  cue  to  shift  attention. 
Furthermore,  it  was  shown  that  attention  increases  the  rate  at  which 
information  is  extracted  from  a  stimulus.  Thus,  internal  attention  shifts 
are  fast  enough  to  be  used  in  normal  visual  perception.  They  may  also  be 
a  component  of  skilled  performance  in  vision-dependent  tasks. 
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HOW  QUICKLY  CAW  ATTENTION  AFFECT  FORM  PERCEPTION? 


I.  INTRODUCTION 

A  variety  of  experiments  Have  shown  that  one  can  attend  to  different 
locations  in  the  visual  field  without  changing  fixation.  Such  a  change  in 
attended  location  will  be  referred  to  as  an  "internal  attention  shift." 
Internal  attention  shifts  result  in  increased  speed  and  accuracy  of 
responses  to  a  stimulus  presented  at  or  near  the  new  attended  location 
(Bashinski  &  Bachrach,  1980;  Posner,  1900).  Both  detection  and  dis¬ 
crimination  performance  are  enhanced  ( Er i < sen  «  Hoffman,  19/2a,  19/Lo, 

1973;  Remington,  1980). 

Are  internal  attention  shifts  a  laboratory  curiosity,  or  a  necessary 
part  of  vision?  Treisman  and  her  colleagues  (Treisman  &  Gelade,  1980; 
Treisman  &  Paterson,  1984;  Treisman  5  Schmidt,  1982)  have  argued  that 
attention  shifts  are  used  to  conjoin  visual  features  into  the  percept  of  a 
unified  object  at  a  specific  location.  Further,  Ullma.i  (1984)  rid s 
suggested  that  attention  shifts  are  essential  for  correctly  perceiving  the 
extent  of  visual  boundaries  and  the  spatial  relationships  among  them.  For 
example,  the  process  of  deciding  whether  or  not  two  points  fall  on  the 
same  boundary  might  require  that  the  boundary  be  "traced"  from  one  point 
to  the  other  using  attention  snifts. 

These  and  other  ideas  about  the  role  of  internal  attention  shifts  in 
constructing  the  visual  percept  share  at  least  one  common  assumption: 
that  changes  in  the  visual  location  being  attended  can  be  made  very 
rapidly.  It  is,  after  all,  possible  to  see  many  of  the  oojects  in  a 
visual  scene,  with  their  features  correctly  conjoined,  after  brief 
tachi stoscopic  presentations  (e.g.,  Biederman,  1972).  This  could  require 
numerous  attention  shifts  during  a  half-second  viewing.  Does  the  focus  of 
attention  really  change  that  quickly? 

Studies  of  internal  attention  shifts  have  shown  that,  under  some 
conditions,  the  effects  of  a  shift  can  be  measured  SO  milliseconds  after 
the  onset  of  a  cue  indicating  trie  location  of  the  target  (Colc-jute, 

Huffman,  &  triksen,  13/3;  Eriksen  &  Hoffman,  l972o,  197a;  Posner,  1980; 
Posner  &  Cohen,  1984).  Bergen  and  Julesz  (1983),  using  a  paradigm  that 
did  not  involve  location  cuing,  also  inferred  an  attention  snift  latency 
of  about  50  mi  1 1  i secorius.  In  other  instances,  however,  effects  did  not 
appear  until  several  hundred  milliseconds  later  (Posner,  1980;  Remington, 
1980;  Remington  A  Pierce,  1984;  Sperling  A  Reeves,  1988). 

In  experiments  that  showed  evidence  for  rapid  attention  effects,  the 
cue  to  shift  attention  was  usually  a  stimulus  located  near  the  target. 

This  kind  of  cue  will  be  referred  to  as  a  "target-area  cue,"  to  distinguisn 
it  from  a  foveal  cue,  which  is  presented  at  fixation.  A  typical  10V'  il 
cue  is  an  arrow  that  points  to  the  target  area.  Studies  using  this  <  u<_ 
are  among  tnose  in  which  attention  effects  were  blower  to  develop. 

Much  of  the  data  relevant  to  attent ion-sn i I  ting  speed  has  ocer 
obtained  in  the  context  of  a  "moving  spotlight'  view  of  jttention. 

Shulman,  kemt  r,.Ht  m.,  and  McLean  (1979)  presented  evidence  that  when 


attention  is  shifted  from  one  visual  location  to  another,  it  noves  in  an 
analog  fashion  over  the  intervening  points  in  the  visual  field,  like  a 
moving  spotlight.  Tsai  (1983)  attempted  to  infer  the  speed  of  this 
hypothetical  analog  attention-shifting  process  using  a  target-area  cue. 

He  arrived  at  an  estimate  of  8  milliseconds  per  degree  of  visual  angle 
traversed.  However  Remington  and  Fierce  (1984),  using  a  foveal  cue, 
claimed  that  the  latency  of  attention  effects  does  not  depend  on  the 
distance  traveled  by  the  shift.  Finally,  Reeves  and  Sperling  (in  press) 
proposed  a  model  of  attention  that  does  not  include  an  analog  shift 
component.  In  their  model,  an  "attention  gate"  opens  up  over  time  at  the 
to-be-attended  location.  In  the  experiments  from  which  this  model  was 
derived,  the  latency  of  attention  snifts  to  a  complex  foveal  cue  was  found 
to  be  relatively  long  (200  to  400  milliseconds). 

At  least  two  factors  complicate  attempts  to  measure  the  latency  of 
attention  effects  on  form  discrimination.  One  is  the  aforementioned 
latency  difference  between  target-area  and  foveal  cues.  Jonides  (1981) 
showed  that  foveal  and  target-area  cues  also  differ  in  the  degree  to  which 
they  automatically  elicit  attention  shifts.  Target-area  cues  do  not 
interfere  with  a  concurrent  memory  task  as  foveal  cues  do,  and  they  affect 
performance  even  when  they  are  not  helpful  and  subjects  are  trying  to 
ignore  them.  All  of  these  results  raise  the  possibility  that  the  rapid 
target-area  cue  effect  is  either  mediated  by  a  very  specialized 
attentional  mechanism  or  is  not  an  attentional  effect  at  all.  For 
example,  target-area  cue  effects  might  be  due  to  lateral  masking,  or  to 
residual  activity  in  peripheral  visual  channels.  On  the  other  hand,  the 
long  latency  of  foveally  cued  attention  shifts  might  simply  be  due  to  the 
time  required  to  interpret  the  cue  and  determine  where  to  direct 
attention.  The  process  of  actually  changing  the  locus  of  attention,  and 
its  subsequent  effects  on  form  discrimination,  might  oe  the  sane  for  uoth 
cue  types. 

The  first  two  of  the  present  experiments  were  directed  in  part  toward 
resolving  this  issue.  The  time  courses  of  attention  effects  resulting 
from  both  kinds  of  cues  were  measured  and  compared.  The  result  was  that 
the  major  differences  in  time  course  due  to  cue  type  were  nearly 
eliminated  by  extensive  practice  with  the  foveal  cue.  The  remaining 
difference  in  the  latency  of  attention  effects  is  probably  due  to  a  small 
difference  in  the  time  required  to  interpret  a  ruveal  arrow  cue  versus  a 
target-area  flash. 

A  second  obstacle  to  de  termini  rig  the  temporal  characteristics  of 
attention  effects  on  form  discrimination  is  that  form  discrimination 
itself  takes  time.  In  the  task  of  detecting  a  luminance  increment,  the 
target  is  typically  presented  for  b  milliseconds  or  less,  whereas  form 
discrimination  usually  requires  tnat  the  target  be  present  for  tens  of 
milliseconds.  This  presents  a  potential  problem.  Should  one  assume  that 
the  time  available  for  attention  shifting  is  the  time  between  the  onset  of 
the  cue  and  the  onset  of  the  target,  or  the  time  between  the  onset  of  the 
cue  and  the  offset  of  the  target?  ->ince  some  of  the  time  that  tne  target 
is  present  couTd  also  be  used  for  attention  shifting,  a  latency  derived 
from  the  typical  practice  of  plot  Ling  performance  as  a  function  of 
cue-target  onset  asynchrony  could  ho  a  a  rinus  underestimate. 


The  second  and  third  experiments  examined  this  issue  by  moisuri  ng  the 
time  course  of  attention  effects  for  varying  target  durations.  The 
results  were  compared  to  predictions  oased  on  the  assumption  that  tne 
latency  of  tne  first  measurable  attention  effects  on  form  discrimination 
is  relatively  long  (greater  than  1 00  milliseconds  including  stimulus 
presentation  time).  These  predictions  were  not  confirmed;  the  data  show 
that  attention  effects  begin  very  quickly  at  the  target  location. 
Furthermore,  it  appears  that  attention  can  increase  the  rate  at  which 
information  is  extracted  from  a  stimulus.  This  latter  result  led  to  the 
development  of  a  model  of  the  temporal  aspects  of  internal  attention 
effects. 


II.  EXPERIMENT  1:  TIME  COURSE  OF  ATTENTION  EFFECTS  FOR  FOVEAL  VS. 

TARGET-AREA  CUES 

In  this  experiment,  the  effects  of  internal  attention  shifts  on  the 
discrimination  of  T -1  ike  target  figures  were  assessed  using  a  paradigm 
that  combines  some  elements  of  experiments  cited  earlier  by  Posner  (1900), 
Remington  (1980),  and  Bashinski  and  Bachrach  (1980).  A  cue  directed 
attention  to  the  location  of  a  target  figure.  The  time  between  cue  onset 
and  target  onset  ( i .  c . ,  the  time  available  to  sni ft  one's  attention  before 
the  target  arrived)  was  varied.  This  time  will  henceforth  oo  called  the 
"cue-target  onset  asynchrony  (CTOA)." 

If  attention  can  indeed  be  moved  to  tne  target  location  during  the 
CTOA,  and  if  it  can  improve  form  discrimination  accuracy,  then  the 
proportion  of  correct  discriminations  should  increase  as  more  time  is 
allowed  to  shift  attention.  Plotting  performance  as  a  function  of  CTOA 
yields  a  time  course  of  attention  effects. 

Of  particular  interest  in  Experiment  1  was  a  comparison  of  the  time 
courses  obtained  wi tn  foveal  cues  and  target-area  cues.  Differences  in 
latency  obtained  with  these  cue  types  are  theoretically  important  for  two 
reasons:  (a)  the  aforementioned  possibility  that  the  target-area  cue 
effect  may  not  be  an  attentional  effect  at  all,  and  (b)  the  notion  tnat 
the  target-area  cue  may  not  be  representati ve  of  how  attention  is  normally 
directed.  During  a  single  fixation  of  a  real  scene,  one's  attention  is 
seldom  driven  by  a  series  of  abruptly  appearing  peripheral  cues,  it  is 
therefore  possible  that  the  foveal  cue  is  more  characteristic  of  the  way 
internal  attention  shifts  arc  normally  initiated.  One  might  then  conclude 
from  the  existing  data  that  foveally  cued  attention  shifts  would  be  too 
slow  to  be  involved  in  the  kind  of  object  perception  operations  posited  by 
Treisman,  Ullman,  and  others.  However,  most  of  the  existing  experiments 
used  detection  tasks  and  reaction  time  measures.  No  experiment  has  used 
form  discrimination  accuracy  to  compare  the  time  course  of  attention 
effects  for  tne  two  cue  types;  so,  it  was  necessary  to  determine  whether  a 
large  latency  difference  exists  under  tnese  conditions. 

Method 

Observers.  Three  observers  ( two  females,  ori<-  male)  with  Dermal 
vision  were  tested.  None  had  participated  in  similar  experiments  before. 


1  w  w  irj  mj  *■; 


vV.V.V.' 


All  were  paid  $7/hour  plus  a  small  Donus  for  correct  responses  independent 
of  condition. 

Apparatus.  Stimuli  were  presented  on  an  Aridek  dOU-A  video  monitor 
controlled  by  an  International  Business  Machines  personal  computer 
1 1 BM- PC)  using  a  non-i  nterlaced  60  riz  frame  rate.  The  phosphor  was  P-31, 
which  decays  to  10'i  of  initial  radiant,  energy  in  40  microseconds.  The 
luminance  of  the  stimuli  was  36.4  cd/n-,  and  they  were  displayed  on  a 
dark  surround.  The  right  eye  of  each  observer  was  monitored  using  a 
standard  video  camera  with  zoom  lens  and  a  separate  display  monitor. 
Adjustable  head  and  chin  rests  were  usi.  ’  to  maintain  head  position  .39.  /  r 
hv:';  the  stimuli. 

'.Stimuli .  Each  of  four  target  stimuli  consi stop  m  a  1-  uj  l-deyr.  • 
plus  sign  with  one  of  its  arms  removed  tc  yield  a  T -1  ike  figure.  The  i;  a  > 
followed  che  oumr  contours  of  a  plus  sign  (see  Figure  1 ) .  The  actual 
stimuli  were  1  iyht-on-dark,  thouyii  their  depictio..  in  the  figure  is 
dark-on-1 i ght. 
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Figure  1.  Sequence  of  i  vents  on  Each  Trial,  Experiment  1. 

Procedure.  Figure  I  shows  the  sequence  of  events  for  each 
experiment'd  T  trial.  First,  a  small  fixation  dot  was  presented  i or  odd 
milliseconds  in  the  center  of  the  display.  The  dot  turn  disappeared,  and 
at  tne  same  time,  a  cue  appeared  to  tell  me  observe-  which  target 
location  would  contain  tne  relevant  target.  In  toe  target-area  cue 
condition,  a  U. S-degree  square  was  presented  either  4  or  7  degrees  to  tm 
right,  left,  above,  or  below  the  fixation  point.  This  square  was  the  cue 
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to  the  location  of  the  relevant  target.  In  the  fovea!  cue  condition,  a 
0.5-degree  arrow  was  presented  at  the  fixation  point.  The  arrow  pointed 
either  left,  right,  up,  or  down.  Both  target-area  and  foveal  cues  were 
displayed  for  one  video  frame  (17  msec).  Ttiere  followed  a  blank  interval 
that  varied  in  duration  between  17  and  283  msec.  The  sum  of  the 
(constant)  cue  duration  and  the  blank-interval  duration  is  the  CfOA.  On 
each  trial,  one  of  10  CTOAs  was  randomly  selected  (34,  50,  67,  84,  117, 
134,  167,  200,  267,  or  300  msec). 

Following  tne  blank  interval,  four  target  stimuli  v/ere  presented,  to 
the  right,  left,  above,  and  below  fixation.  All  targets  were  presented 
either  3  or  o  degrees  from  the  fixation  point.  The  target  to  be  presented 
at  each  location  on  a  given  trial  was  randomly  selected,  with  replacement 
from  tne  set  of  four  T-like  figures  descrioed  above.  Stimuli  remaineo 
illuminated  for  either  50  or  84  milliseconds  and  then  were  masked.  The 
masks  remained  in  place  until  tne  observer  had  decided  which  stimulus  nad 
been  presented  in  the  cued  location  and  had  indicated  his  or  her  decision 
by  depressing  one  of  the  four  arrow  keys  on  the  IBM-PC  numeric  keyboard. 
When  a  response  was  made,  the  observer  was  presented  with  feedback  as  to 
its  correctness,  and  the  next  trial  was  initiated. 

Observers  were  instructed  to  maintain  fixation  in  the  center  of  the 
display.  They  were  informed  when  significant  eye  movements  occurred, 
although  stimulus  presentation  was  too  rapid  to  allow  a  saccade  to  the 
target.  Thus,  eye  movements  could  only  diminish  performance  through 
saccadic  suppression. 

The  experiment  consisted  of  eight  1-hour  sessions.  Each  session 
consisted  of  five  blocks  of  100  trials  each.  Within  a  block,  the  same 
target  eccentricity,  target  duration,  and  type  of  cue  were  used.  The 
order  of  presentation  of  the  resulting  eight  conditions  was 
counterbalanced. 

Pretraining.  Prior  to  their  participation  in  the  experiments,  all 
observers  were  pretrained  on  foveal  identification  of  the  four  possible 
targets.  On  each  pretraining  trial,  one  of  the  four  possible  targets  was 
presented  foveal ly  for  a  duration  which  varied  according  to  tne  observer's 
performance  on  the  previous  trial.  If  the  previous  trial  was  correct,  a 
counter  was  decremented.  When  the  decrement  totaled  17,  tne  stimulus 
duration  was  reduced  by  one  video  frame  (17  msec).  If  the  preceding  trial 
was  incorrect,  the  counter  was  incremented.  The  size  of  the  increment  and 
decrement  was  controlled  so  that  overall  proportion  correct  would  be  as 
close  to  0.625  as  possible.  Tnis  proportion  represents  50  correct 
di scrimi nati on  when  corrected  for  guessing. 

Immediately  following  the  presentation  of  the  target,  the  mask  was 
presented,  and  the  observer  decided  which  target  had  appeared.  Following 
the  observer's  response  on  the  computer  keyboard,  the  value  ot  the  counter 
representing  stimulus  duration  was  displayed.  Observers  were  told  that 
this  number  would  get  smaller  if  their  answer  was  correct  and  larger  if  it 
was  incorrect,  and  that  they  should  try  to  obtain  as  small  a  number  as 
possible.  Observers  were  pretrained  on  1,500  target  identification  trials 
before  starting  the  experiments.  Previous  data  indicated  that  t.nis  was 
sufficient  practice  for  performance  to  asymptote. 
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The  critical  aspect  of  tin-  data  nl  jure  <’  is  tne  effect  of  cur  lyp" 
on  performance.  Fur  the  target-ai  'a  cu  effects  of  attention  were 
present  after  a  CTOA  of  only  5el  mi  11  i s<  ••  os,  whereas  for  ttie  fovea)  •_ ue s , 
effects  were  much  slower  to  develop.  mn  r  e  was  no  clear  effect  of 
increasing  CTOA  in  me  fovea!  cm  con  *  on  before  about  lbl)  msec. 
Consequently,  average  proportion  curr-  was  significantly  nigher  for  tne 
target-area  cues  than  for  tin  +oveal  c  (  .47  vs.  .38,  x^i.1 ]  =  30.8, 
p  •  .001).  Thus,  the  results  are  cor  mt  with  those  obtained  for 

luminance  increment  detection,  in  that  me  latency  of  attention  effects 
appears  to  lx;  much  greater  with  the  •  -t-  rma  cues  than  with  the  fov-al 

cues.  In  addition,  the  data  show  tb  . :  .  >r  me  target-area  cues, 
performance  was  near  ivnijf.ne  if  te»  it  0  milliseconds,  whereas  f  or 

tiie  foveal  cues,  a  i  lear  asympt ot.--  of  r<.-aciu.>«J  within  the  ran /„  of  the 

CTOAs  sampled. 

It  is  evident  from  this  ex  per  t  t  the  time  courses  of  attention 

effects  differ  for  the  two  types  ms.  It  is  possible  that  the  actual 

operation  of  shifting  attention  :  ...*  -  •  for  bo t n  cue  types;  however, 

determining  where  to  attorn  tat'-  gc  .  th  the  foveal  cue.  This  could 

occur  because  the  relati  ip  *n  foveal  cue  arid  the  exact  visual 

location  to  be  attended  •  b  •  no  >  -d  thus  the  to-bc-attondr. d 

location  must  be  rntrieo  ro  ry  act)  trial.  One  way  to  assess 

tnis  possibility  is  to  e •  :  ne  tin  irse  of  attention  shifts  after 

large  amounts  of  practii  ■ t1  fo.  . uos.  if  shifts  that  are  cued 

foveally  are  slower  only  an  ■  •  ast  be  interpreted,  then 

automating  tnis  process  up  -t  >  >hould  result  in  much  faster 

foveally  cued  shifts. 

ill.  EXPERIMENT  2:  EFt  CUE  PRACTICE  MU  TARGET  UURATiOd 

This  experiment  was  de  .etennine  whether  extensive  practice 

markedly  alters  the  time  coe>  ovoally  cueo  attention  shifts,  as 

would  be  expected  under  tine  hyp....esis  that  tne  differences  found  in 
Experiment  1  are  due  to  differences  in  toe  speed  of  interpreting  the  cue. 
If  tne  latency  and  magnitude  of  foveally  cued  shifts  can  he  brought  into 
the  range  of  those  found  with  tne  peri  liberal  cue,  turn  it  'would  be  clear 
that  fast  cuing  effects  are  possible  under  conditions  that  rule  out 
lateral  masking  or  other  target  area  interactions. 

To  the  extent  that  large  practice  effects  occur,  their  generality 
becomes  an  issue.  Tnerofore,  practice  was  given  in  only  two  of  the  four 
possible  directions  and  one  of  the  two  possible  eccentrici ties.  Potential 
effects  of  practice  on  identifying  the  target  were  minimized  ny  tin-  large 
number  of  target  idonti  fi  cation  pre-training  trials  given  prior  to 
Experiment  1. 

Ketnod 

This  experiment  was  run  immediate iy  after  Experiment  1,  using  the 
same  observers,  equipment,  and  procedure.  Before  the  data  were  collected, 
observers  were  given  7, 8 (Jo  prac  ti  •  .*.  trials  using  only  the  tov'-al  cue. 
During  the  prac  tie0  pham  ,  *  argots  inain  oe-.ur  in  only  two  positions 


(above  or  below  the  fixation  point)  anil  at  only  the  6-degree  eccentricity. 
A  target  duration  of  84  msec  was  used  for  the  first  4,01)0  trials.  For  the 
final  3,500  trials,  the  target  duration  was  50  milliseconds. 


During  practice,  tne  CT0A  was  varied  on  a  trial -by-trial  basis  using 
the  same  adaptive  procedure  that  was  used  for  varying  stimulus  duration 
during  the  pretraining  phase  of  Experiment  1.  Again,  a  counter  that 
represented  CTOA  was  decremented  after  a  correct  response  and  incremented 
after  an  incorrect  response,  and  observers  attempted  to  minimize  the 
counter  value.  The  initial  value  of  tne  counter  was  200,  which 
represented  a  CTOA  of  234  milliseconds. 

Results  and  Discussion 

As  in  Experiment  1,  discrimination  performance  increased  significantly 
with  increasing  CTOAs  for  both  foveal  and  target-area  cues  ( target-area : 
x2[9]  =  56. 6,  p  \  .001  ;  foveal:  x^LOj  =  27.7,  p<.005).  However, 
extensive  practice  substantially  changed  tne  differences  found  in 
Experiment  1  between  foveal  and  target-area  cues.  In  fact,  practice 
completely  eliminated  the  effect  of  eue  type  on  average  proportion  correct 
( target-area--.  65,  foveal--. 64,  x^l.  =  .3a,  p  '.25).  Figure  3  shows 
the  relationship  Detween  CTOA  lehgtn  and  performance  for  both  cue  types  in 
both  experiments.  In  Experiment  1,  the  CTOA  at  which  a  particular 
proportion  correct  was  readied  was  roughly  1 50  milliseconds  longer  for 
foveal  cues  than  for  target-area  cues.  In  Experiment  2,  this  difference 
was  25  to  50  milliseconds.  Some  difference  between  the  two  conditions 
would  be  expected  even  after  practice  because  use  of  tne  foveal  cue  would 
still  require  the  retrieval  of  the  to-be-attended  location  from  memory. 

But  there  is  no  longer  much  reason  to  suspect  that  the  time  courses 
associated  with  the  two  cues  reflect  totally  different  phenomena. 

As  in  Experiment  1,  Mere  was  no  significant  effect  of  target 
eccentricity  for  either  cue  type  ( target-area :  x^Llj  =  2.86,  p  .05; 
foveal:  x'-[l]  =  1.68,  p  .10).  However,  tne  stimulus  duration  effect 
wis  significant  for  both  eue  types  ( target-area :  x^Ll j  -  31.8, 

P''  .001;  foveal:  x--  i_lj  =  11.8,  p  .001).  This  finding  is  discussed 
further  below. 

AnoMor  issue  was  the  generality  of  the  observed  practice  effects. 

Tne  overall  proportion  of  correct  responses  improved  yreatly  with  practice 
for  both  cue  types  ( target-area :  x^il j  =  u7.8,  p  .001;  foveal: 
x^LlJ  -  123.0,  fj  ■'  .001).  There  was  also  significant  improvement 
fp  .01)  for  alT  target  directions,  durations,  ana  eccentricities,  hone 
of  tiie  interactions  between  these  v^riaoles  and  practice  was  significant. 
There  was,  however,  a  significant  interaction  between  practice  and  cue 
type  ( l I J  -  o.j4,  p  '  .001).  because  performance  improvements  with 
practice  were  not  confined  to  the  particular  snift  direction, 
eccentricity,  or  cue  type  that  wjs  practiced,  practice  must  nave  improved 
ei  tiler  the  speed  of  smiting  at  ton*  ion  itself  or,  more  likely,  extraneous 
aspects  of  the  task.  However,  tm  >i gni t lea  it  interaction  between  cue 
typo  and  practice  indicates  Mat  there  was  a  component  of  practice  etteits 
that  was  specific  to  cue  type.  I n i s  result  would  oe  expocced  if  one  oi 
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Figure  3.  Proportion  Correct  as  a  Function  of  CTOA  for  Both 
Foveal  and  Target-Area  Cues,  Experiments  1  and  2. 

the  effects  of  practice  was  to  strengthen  the  association  between  the 
foveal  cue  and  the  to-be-attended  location. 

Given  the  main  results  of  this  experiment,  it  seems  reasonable  to 
assume  that  both  foveal  and  target-area  cues  can  initiate  a  shift  of 
internal  attention  to  a  new  target  location.  We  now  return  to  the 
original  question  that  motivated  the  research:  How  quickly  can  such  an 
attention  shift  begin  to  affect  form  discrimination? 

Suppose  that  the  data  from  the  target-area  cue  condition  of 
Experiment  2  reflect  the  time  course  of  a  change  in  internal  attention 
locus.  Based  on  these  data,  as  shown  in  Figure  3,  it  is  clear  that  an 
improvement  in  performance  occurs  between  34  and  50  milliseconds.  This 
could  be  interpreted  to  mean  that  the  latency  of  attention  effects  is  less 
than  50  milliseconds.  Considering  that  it  must  take  some  time  to  detect 
and  localize  the  cue,  these  data  might  be  thought  to  imply  that  changing 
attentional  locus  is  a  very  rapid  process;  but  there  is  a  problem  with 
such  an  interpretation. 

The  problem  may  be  illustrated  by  considering  a  model  in  which 
attention  acts  like  a  "shutter"  that  does  not  open  until  100  milliseconds 
after  the  onset  of  the  cue.  Suppose  that  the  duration  of  t he  target  is  oO 
milliseconds.  If  the  CTOA  is  34  milliseconds,  then  the  total  time  from 
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the  onset  of  the  cue  to  the  onset  ■  > f  tne  mask  is  84  milliseconds.  Thus, 
the  shutter  will  not  have  opened  when  the  mask  appears,  and  performance 
will  be  at  baseline.  With  a  CTQA  of  67  milliseconds,  however,  the  CTOA 
plus  target  duration  is  117  msec;  so,  the  shutter  will  have  been  open  for 
17  milliseconds  before  the  mask  appears.  A  CTOA  of  84  milliseconds  means 
that  the  target  would  be  in  view  for  34  milliseconds  uefore  being  masked, 
and  so  on.  Under  these  conditions,  performance  would  asymptote  at  a  CTOA 
of  100  milliseconds,  since  tne  shutter  would  open  in  tine  to  reveal  the 
target  for  its  entire  duration.  This  is,  in  fact,  approximately  the 
point  at  wnich  the  target-area  data  for  Experiment  3  reached  asymptote. 

This  simple  model  shows  that  increases  in  performance  with  small 
CTOAs  do  not  necessarily  imply  very  rapid  attention  shifts.  The  shutter 
model  is  merely  an  illustrative  device,  but  the  sane  considerations  would 
hold  for  a  model  in  which  100  msec  or  more  were  required  for  the  focus  of 
attention  to  shift  across  visual  space,  arrive  at  the  target,  and  thus 
begin  to  affect  performance.  Notning  in  the  data  as  analyzed  so  far  rules 
out  this  possibility.  However,  the  shutter  model  implies  that  varying  the 
duration  of  the  target  should  markedly  affect  the  observed  time  course  of 
attention  effects.  Specifically,  as  target  duration  is  reduced,  longer 
CTOAs  are  required  before  the  effects  of  attention  begin  to  appear. 
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Figure  4  shows  hypothetical  data  that  illustrate  this  effect,  again 
assuming  that  attention  operates  like  a  shutter  that  opens  100  msec  after 
the  cue.  The  two  curves  in  the  figure  were  computed  for  the  two  target 
durations  (50  msec  and  84  msec)  of  Experiments  1  and  2,  using  the  equation 

Proportion  Correct  =  ((TO  +  CTOA)  -  100)  *  .0045  +  .35  (1) 

where  TD  denotes  target  duration,  and  the  quantity  ( ( TD+CTOA ) -100)  has  a 
maximum  value  of  TO  and  a  minimum  value  of  zero.  The  relative  height  of 
each  point  depends  on  the  length  of  time  that  the  target  remains  visible 
after  the  hypothetical  shutter  opens.  As  the  figure  shows,  varying  the 
duration  of  the  target  does  not  affect  the  point  at  wnich  the  curve 
asymptotes,  but  does  affect  the  point  at  which  it  starts  to  rise.  After 
the  curves  begin  to  rise,  they  are  parallel. 
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Figure  5.  Proportion  Correct  as  a  Function  of  CTOA  for  50  and  84 
msec  Target  Durations.  Data  are  from  the  target-area 
cue  condition  of  Experiment  2. 

Figure  5  shows  the  target-area  cue  data  from  Experiment  2  plotted 
separately  for  each  of  the  two  target  durations.  Some  aspects  of  these 
results  do  not  match  the  hypotnetical  shutter  model  predictions  of  Figure 
4.  First,  the  experimental  curves  both  appear  to  rise  at  the  same  point; 
there  is  no  indication  that  the  50-msec  curve  rises  later.  Second,  the 
curves  do  not  appear  to  asymptote  at  the  same  CTOA;  the  84-msec  curve 
appears  to  asymptote  earlier.  Third,  the  84-nsec  curve  appears  to  rise 
more  steeply. 
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To  summarize  the  foregoing  analysis:  The  fact  that  attention  effects 
appear  at  short  CTOAs  does  not  rule  out  models  in  which  attention  has  a 
long  latency,  since  attention  may  shift  during  target  presentation. 
However,  when  tne  data  for  two  different  target  durations  were  compared, 
some  predictions  of  a  long-latency  model  (i.e.,  effects  beginning  at 
different  CTOAs  and  reaching  asymptote  at  the  same  CTOA)  were  not 
confirmed.  Because  these  results  are  critical  for  making  distinctions 
between  long-  and  short-latency  models,  it  was  necessary  to  attempt  to 
replicate  them  in  a  larger  experiment. 


IV.  EXPERIMENT  3:  EFFECT  OF  VARYING  TARGET  DURATION: 

A  TEST  OF  LONG-LATENCY  MODELS 

In  order  to  reconcile  long  latencies  for  attention  shifts  with  the 
observed  increase  in  discrimination  performance  at  small  CTOAs,  it  must  be 
assumed  that  attention  continues  to  shift  to  the  target  location  while  the 
target  is  being  presented.  Thus,  if  the  target  duration  is  reduced,  a 
correspondingly  larger  CTOA  should  be  required  to  produce  a  rise  in 
discrimination  performance.  Short-latency  models,  However,  would  predict 
that  performance  rises  at  small  CTOAs  regardless  of  the  target  duration. 

In  this  experiment,  a  34-msec  target-duration  condition  was 
investigated,  along  with  the  50-msec  and  84-msec  conditions  used  in 
Experiments  1  and  2.  Smaller  CTOAs  were  used,  CTOAs  were  sampled  more 
densely,  and  the  number  of  experimental  trials  per  observer  was  increased 
from  2,000  to  7,000. 

Method 

Observers.  Three  female  observers  with  normal  or  corrected-to-normal 
v'sion  were  each  tested  for  11  sessions  of  approximately  1  hour  each. 

None  of  the  observers  had  participated  in  Experiments  1  and  2.  Two  of 
them,  however,  had  participated  in  similar  experiments.  They  were  paid 
$7/hour  plus  a  small  bonus  for  correct  responses  independent  of 
condition.  The  third  observer  was  a  staff  member  who  received  no 
compensation  beyond  her  regular  salary. 

Apparatus.  Stimuli  were  presented  on  an  IBM  Enhanced  Color  Monitor 
and  controTTed  by  an  I Bfl- PC/XT  containing  an  Enhanced  Graphics  Adapter. 
Decay  times  for  the  phosphors  (P-22-B,  P-22-G,  and  P-22-R)  were  less  than 
1  msec.  Stimuli  were  presented  on  a  dark  background.  Their  luminance  was 
13.7  cd/m^.  As  in  Experiments  1  and  2,  eye  position  was  monitored  using 
a  video  camera  with  zoom  lens,  and  a  separate  display  monitor.  Adjustaole 
head  and  chin  rests  were  used  to  maintain  head  position. 

Stimul i .  Stimuli  and  masks  were  the  same  as  those  used  in 
Experiments  1  and  2;  however,  the  pixel  density  was  increased  from  (>4  to 
112  pixels  per  character. 

Procedure.  The  experimental  procedure  was  the  sane  as  in  Experiments 
1  and  2,  with  the  following  exceptions:  (a)  Tne  fixation  pumt  remained 
illuminated  during  the  entire  experimental  trial,  (b)  The  experiment  used 
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13  CTOAs  (16.7  to  217  msec  in  16.7-msec  steps),  three  target  durations 
(34,  50,  and  84  msec),  one  eccentricity  (6  degrees),  and  one  cue  type 
( target-area) .  (c)  Seven  experimental  sessions  and  one  practice  session 

of  1,000  trials  each  were  run.  (d)  A  full-attention  target  discrimination 
session  was  run  once  prior  to  the  practice  and  experimental  sessions,  and 
twice  after  their  conclusion.  The  purpose  of  these  sessions  was  to  obtain 
an  estimate  of  the  rate  at  which  target  discrimination  performance  improves 
with  presentation  duration,  under  optimal  (foveal,  full-attention) 
conditions,  for  the  target  set  used  in  the  experiment. 

In  the  ful 1 -attenti on  sessions,  a  fixation  point  was  presented  for 
668  msec  at  the  center  of  the  display.  It  was  then  replaced  by  one  of  the 
four  T-like  target  figures,  which  remained  on  the  screen  for  a  randomly 
selected  duration  before  being  replaced  by  the  mask.  The  observer  then 
indicated  with  a  keypress  which  one  of  the  four  targets  had  appeared.  The 
six  possible  target  durations  ranged  from  16.7  to  100  milliseconds  in 
16.7-millisecond  intervals. 

Results  and  Discussion 
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Figure  6.  Proportion  of  Correct  Discriminations  as  a  Function 
of  CTOA  for  Target  Durations  of  34,  50,  and  84  msec, 
Experiment  3. 

Figure  6  shows  discrimination  performance  in  the  experimental  sessions 
as  a  function  of  CTOA  for  all  three  target  durations.  Several  facts  are 
evident  from  this  figure,  in  addition  to  the  expected  significant  effect 
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of  target  duration  on  average  performance  (x2[?.j  =  fa 11.2,  p  •  .001). 

First,  for  each  target  duration,  there  was  a  significant  increase  in 
performance  between  CTOAs  of  17  and  34  msec  (84-msec  TD:  x?L  1 J  =  16.  fa, 
p  <  .001;  50-msec  TD:  x2[lj  =  9.8,  p  <'  .U05;  34-msec  TD:  x - L 1  j  = 

7.7,  p  .01).  This  is  clear  evidence  against  a  long-latency  model,  since 
such  a  model  would  predict  that  the  shorter  the  target  duration,  the 
longer  the  CTOA  required  tu  show  an  initial  rise  in  performance. 

Moreover,  the  fast-rising  parts  of  :i.e  curves  were  not  parallel  (60-84 
msec:  x2[3J  =  35.1;  84-50  msec:  x2L3j  --  21.5;  84-34  msec:  x2l3]  = 

27.5;  all  p  v  .001).  As  target  duration  increased,  the  steeper  the  slope 
of  the  curve  became,  and  the  earlier  the  asymptote  was  reached. 

Thus,  all  of  the  results  suggested  by  the  target-duration  comparisons 
in  Experiment  2  were  observed  in  this  experiment,  including  tuose  that 
argue  against  a  long-latency  model.  In  particular,  the  significant 
performance  improvement  observed  between  CTOAs  of  17  and  34  msec  for  a 
34-msec  target  duration  indicates  that  attention  is  beginning  to  affect 
performance  within  68  nsec  of  cue  onset. 

The  results  also  indicate  how  long  attention  effects  last.  Curves 
for  longer  target  durations  asymptote  at  correspondingly  shorter  CTOAs; 
that  is,  all  curves  asymptote  at  about  the  same  total  time  since  cue 
onset.  This  implies  that  attention  effects  continue  "to*  bui  Id  up  during 
target  presentation  at  about  the  sane  rate  as  they  do  during  the  cue- 
target  interval.  An  estimate  of  the  duration  of  this  buildup  can  be 
obtained  by  plotting  average  performance  as  a  function  of  CTOA  plus  target 
duration  (Figure  7).  The  figure  shows  that  attention  effects  cease 
120-150  msec  after  cue  onset. 


Finally,  the  results  suggest  that  attention  moderates  the 
relationship  between  target  duration  and  discrimination  performance. 

Figure  8  shows  proportion  correct  as  a  function  of  target  duration,  with 
CTOA  as  a  parameter.  Only  data  from  the  first  six  CTOAs  (the  ones  that 
account  for  virtually  all  of  the  ooserved  attention  effects)  are  shown. 

As  CTOA  increases,  there  is  a  systematic  increase  in  tiie  slope  of  tne 
initial  segment  of  the  lines,  the  part  that  represents  an  increase  in 
target  duration  from  34  to  60  msec.  (Tne  84-msec  target  duration  is  not 
included  in  the  slope  because  performance  is  at  asymptote  for  CTOAs  over 
50  msec.)  For  example,  the  slope  obtained  with  a  cTOA  of  lou  nsec  (0,017) 
is  nearly  triple  that  observed  with  a  CTOA  of  1/  msec  (0.00b).  This 
effect  was  highly  significant  in  the  overall  data  (x2[6j  =  u 7 . 1  , 

.001)  and  for  each  of  the  three  observers  (observer  1:  x2L‘jJ  = 

29.9,  p  <'  .001;  observer  2:  x2(_faj  =  19.3,  p  .006;  observer  6: 
x2[5j  =  34.0,  p'  .001).  A  possible  interpretation  is  that  focusing 
spatial  attention  on  the  target  location  increases  the  rate  at  which 
information  is  extracted  from  the  target. 

Tnis  interpretation  is  further  supported  by  tne  observation  that  the 
slope  increase  is  not  simply  a  consequence  of  better  asymptotic  performance 
at  longer  CTOAs.  Examination  of  the  top  three  lines  of  Figure  8  shows 
that  even  though  asymptotic  performance  is  similar  in  all  three,  it  is 
reached  sooner  for  the  lines  representing  longer  CTOAs.  lienee,  if  the 
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Figure  7.  Proportion  of  Correct  Discriminations  as  a  Function  of 
Total  Time  Available  to  Shift  Attention  (CTOA  Plus 
Target  Duration),  Experiment  3,  Points  on  the  curve 
are  averages  over  all  three  target  durations  (34,  50, 
and  04  msec),  except  for  the  first  point,  which 
necessarily  contains  only  data  from  the  34-msec  target 
duration,  a no  the  second  point,  which  contains  only 
data  from  the  34-msec  and  50-msec  target  durations. 


effect  of  increasing  target  duration  is  to  allow  more  information  to  be 
extracted  from  the  target,  then  the  effect  of  attending  internally  to  the 
target  is  to  speed  up  this  extraction  process. 


If  this  interpretation  is  correct,  then  the  ful 1 -attenti on  condition 
that  was  run  before  and  after  the  experimental  trials  should  show  a 
target-durati on  effect  that  is  at  least  as  steep  as  that  obtained  for  the 
100-msec  CTOA  trials  in  Figure  0.  Figure  9  shows  performance  in  the  three 
full -attenti on  condition  sessions.  There  is  a  large  pre/post  effect  on 
the  slope  of  these  curves  ( x2 L 3 J  =  87.5,  p  .001).  Nevertheless,  it  is 
clear  from  the  post-experiment  data  that  a  very  steep  target-durati on 
effect  is  observed,  lust  as  it  was  in  the  long-CTOA  data  of  Figure  8, 
since  in  both  cases  utU  oti"’  we,  focused  on  the  target  area  when  the 
target  appeared. 
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Figure  8. 


Proportion  of  Correct  Discriminations  as  a  Function 
of  Target  Duration,  with  CTOA  as  a  Parameter.  This 
is  a  replotting  of  the  data  from  the  fast- rising 
parts  of  the  curves  in  Figure  6.  It  shows  that  the 
initial  slope  of  the  target-duration  effect 
increases  with  CTOA. 


To  summarize,  the  results  of  Experiment  3  indicate  that:  (a)  Effects 
of  attention  begin  at  the  target  location  in  less  than  63  milliseconds, 
perhaps  mucn  less,  (b)  Attention  effects  continue  until  130-150  msec 
after  the  presentation  of  the  cue,  regardless  of  target  duration;  they  are 
not  terminated  by  the  presentation  of  the  target,  (c)  Attention  increases 
the  rate  at  which  information  is  extracted  from  the  target. 


In  order  to  obtain  more  precise  estimates  of  tiie  latency  and  duration 
of  attention  effects,  the  results  were  formalized  as  a  quantitative 
model.  Let  L  denote  the  number  of  milliseconds  after  cue  onset  at  wliicn 
attention  effects  begin,  and  M  denote  the  time  at  which  they  end;  tnen, 
the  time  during  which  attention  is  having  an  effect  on  the  information  in 
the  cued  location  is: 


Attention  Duration  (AD)  =  min  (  M,  (TD  +  CTOA))  -  L. 


If  attention  increases  the  rate  of  information  extraction  from  the 
target,  then  performance  should  be  a  multiplicative  function  of  attention 
duration  and  target  duration;  that  is: 
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Figure  9.  Proportion  Correct  as  a  Function  of  Target  Duration  in 
the  Full-Attention  Condition,  Experiment  3. 


Proportion  Correct  =  AD  *  TD  *  B  +  C  (3) 

where  B  and  C  are  slope  and  intercept  constants. 

This  four-parameter  model  fits  the  data  from  the  34-msec  and  50-msec 
target  durations  quite  well.  When  the  model  was  fit  to  the  first  eight 
points  of  the  34-msec  and  50-msec  data  (to  avoid  including  the  down¬ 
drifting  presumably  caused  by  occasional  eye  movements),  the  resulting 
parameter  values  were  l  =  22,  M  =  122,  b  =  .000119,  and  c  =  .18.  Thus, 
the  best  model  fit  was  obtained  with  an  attention  latency  estimate  of  22 
msec.  The  model  closely  reproduces  the  systematic  increase  in  the  slope 
of  the  target-duration  effect  with  increasing  CTOAs  (Figure  8)  and  the 
cessation  of  attention  effects  after  about  120  milliseconds  (Figure  7). 

In  order  to  account  for  the  data  from  the  84-msec  target  duration 
condition,  one  change  in  the  model  was  required.  Data  from  the  full- 
attention  condition  (Figure  5)  indicated  that  discrimination  performance 
asymptotes  at  a  target  duration  of  between  50  and  67  milliseconds. 
Therefore,  the  actual  target  duration  of  84  msec  was  replaced  in  Equations 
2  and  3  with  a  new  parameter  representing  asymptotic  target  duration. 
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The  resulting  model  was  fit  to  the  data  for  all  three  target 
durations.  The  values  given  above  for  the  original  four  parameters  were 
fixed;  only  the  new  asymptotic  target-duration  parameter  was  allowed  to 
vary.  The  value  of  this  parameter  was  thus  estimated  to  be  62,  which  is 
within  the  range  indicated  by  the  data  from  the  ful 1 -attention  condition. 
Figure  10  shows  the  fit  of  this  model  to  the  data  from  all  target 
durations.  It  is  clear  that  the  parameter  values  that  fit  the  data  from 
the  shorter  durations  also  fit  the  84-msec  data  quite  well,  once  tne 
asymptote  in  target-duration  effects  is  taken  into  account. 
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Figure  10.  Compari son  of  Predicted  and  Actual  Proportion  of 
Correct  Responses  for  Target  Durations  of  34,  60, 
and  84  msec,  Experiment  3.  Solid  Lines  are  trie 
predictions  obtained  via  Equations  2  and  3  (see 
text) . 


V.  GENERAL  DISCUSSION 

The  purpose  of  these  experiments  was  to  determine  how  quickly  shifts 
of  internal  visual  attention  can  affect  form  perception.  Previous 
research  in  which  a  brief  change  in  luminance  was  used  as  a  target  had 
shown  that  when  attention  was  cued  by  the  onset  of  a  stimulus  in  the 
target  area,  attention  effects  were  evident  much  sooner  after  cue  presen¬ 
tation  than  was  the  case  when  cues  presented  at  fixation  were  used  (Posner, 
1980;  Posner  &  Cohen,  1984;  Remington  1980;  Remington  &  Pierce,  1984). 


Experiment  1  replicated  this  effect  for  a  form  discrimination  task, 
and  Experiment  2  showed  that  the  difference  in  attentional  latency  between 
foveal  and  target-area  cues  could  be  largely  eliminated  with  practice. 

This  latter  result  argues  against  certain  non-attentional  explanations  of 
tiie  target-area  cue  effect  (such  as  masking).  Experiment  2  also  uncovered 
several  differences  between  the  time  course  of  attention  effects  for 
different  target  durations.  These  differences  were  observed  again  in 
Experiment  3,  where  it  was  also  shown  that  (a)  the  latency  of  attention 
effects  is  less  than  68  msec,  (b)  attention  effects  continue  to  build  up 
during  the  presentation  of  the  target  but  cease  at  120-150  msec  after  the 
cue,  and  (c)  attention  increases  the  rate  at  which  target  presentation 
time  improves  di  scrimi  r.ati  on  performance.  A  quantitative  model  incor¬ 
porating  these  findings  was  successful  in  accounting  for  the  observed 
effects  of  CTOA  and  target  duration. 

Attenti on  Latency  and  Percepti on 

These  results  have  important  implications  for  the  general  question 
posed  in  the  introduction;  namely,  are  internal  shifts  of  attentional 
focus  fast  enough  to  be  involved  in  constructing  the  visual  percept? 

Under  optimal  conditions  in  these  experiments,  the  latency  of 
attention  effects  was  estimated  to  be  under  60  msec,  even  considering  the 
possibility  that  attention  could  shift  during  the  presentation  of  the 
target.  Studies  of  attention  effects  on  detection  of  luminance  increments 
using  reaction  time  to  target-area  cues  as  the  dependent  variable  have 
also  reported  the  existence  of  attention  effects  within  50  Milliseconds  of 
cue  onset  (Posner,  1980).  Since  this  time  presumably  includes  the  time  to 
process  the  cue  to  shift  attention,  and  perhaps  other  operations  as  well, 
the  actual  time  required  for  the  focus  of  attention  to  change  is  clearly 
fast  enough  to  allow  a  serial  process  to  select  many  different  processing 
locations  during  a  single  fixation. 

However,  there  remain  at  least  two  possible  objections  to  the  idea 
that  rapid  internal  attention  shi fts  underlie  some  aspects  of  perceptual 
processing.  One  is  that  even  if  a  new  attended  location  can  be  selected 
ir,  a  few  milliseconds,  perceptual  processing  at  the  new  location  will 
surely  take  much  more  time.  If  such  processing  requires  attention,  then 
fewer  shifts  of  processing  focus  will  be  possible  witnin  a  typical 
fixation.  In  the  present  experiments,  for  example,  attention  continued  to 
affect  performance  for  at  least  120  milliseconds.  Thus,  for  the  present 
task,  at  most  only  two  or  three  discriminations  requiring  full  attention 
could  be  performed  during  a  fixation.  Moreover,  it  might  be  possible  to 
lengthen  the  time  for  which  attention  will  affect  processing  (M-L)  oy 
increasing  the  amount  of  information  that  has  to  be  extracted  from  the 
target. 

The  operations  for  which  Treisman  and  Gelade  (198U)  and  Oilman  (1984) 
suggested  a  serial  attention  mechani sm--operations  such  as  feature 
conjunction  or  boundary  tracing--may  not  require  anything  like  a  full 
accumulation  of  attention  effects.  Moreover,  one  can  easily  imagine  a 
system  in  which  computations  arc  started  in  serial  fashion  at  various 
index  points,  but  the  selection  of  each  new  index  point  docs  not  depend  on 
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the  completion  of  the  computation  started  at  the  preceding  point.  In  such 
a  system,  the  important  parameter  would  indeed  be  the  latency  of  the 
indexing  operation. 


A  second  possible  objection  rests  solely  on  observer  introspection. 

In  experiments  such  as  those  reported  here,  the  observer  has  the  distinct 
impression  that  at  least  small  amounts  of  time  and  effort  are  being  used 
in  focusing  attention  on  the  cued  region.  Yet,  elsewhere,  the  perception 
of  complex  scenes--which  must  result  from  many,  perhaps  dozens  of  succes¬ 
sive  attention  shifts--seems  effortless.  Of  course  there  are  many 
possible  answers  to  an  objection  like  this,  but  one  such  answer  is  that 
the  feeling  of  conscious  effort  is  tied  to  the  accumulation  of  attention 
effects.  It  may  be  that  merely  changing  the  locus  of  processing  is  not 
effortful;  and  if  perceptual  operations  such  as  boundary  tracing  or  feature 
conjunction  are  then  performed,  one  will  not  be  conscious  of  them. 

However,  if  one  attempts  to  enhance  vision  at  a  location  by  focusing 
attention  on  it,  then  this  buildup  of  attention  effects  will  De  found  to 
be  effortful . 

Components  of  Attention 

Implicit  in  the  foregoing  discussion  is  a  strong  separation  ue tween 
(a)  the  changing  of  the  locus  of  visual  processing,  whether  it  be  for  the 
purpose  of  attending  to  a  location  or  performing  some  other  computation, 
and  (b)  the  enhancement  of  vision  when  attention  has  been  focused  on  a 
location. 

This  distinction  has  been  discussed  at  some  length  by  Posner,  Walxer, 
Friedrich,  and  Rafal  (19«4).  They  proposed  that  there  are  actually  three 
components  to  a  change  in  the  focus  of  internal  attention:  First,  atten¬ 
tion  must  be  disengaged  from  its  current  focus;  then  the  focus  of  attention 
is  moved  across  visual  space;  and,  finally,  the  effects  of  attention  ouild 
up  at  the  target  location.  These  operations  are  called,  respectively,  the 
"disengage,"  "move,"  and  "engage"  operations.  Posner  et  al.,  argued  that, 
based  on  performance  experiments  using  individuals  with  specific  brain 
damage,  there  is  good  evidence  that  these  operations  are  localized  in 
different  parts  of  the  brain,  and  each  operation  takes  a  measurable  amount 
of  time. 

The  model  derived  from  the  results  of  the  present  experiments  allows 
one  to  estimate  separately  the  temporal  characteristics  of  processes 
responsible  for  shifting  attention  to  the  target  location  (perhaps  the 
disengage  or  move  operations)  and  processes  that  result  in  improved 
performance  once  attention  has  arrived  there  (the  engage  operation). 

In  determining  how  quickly  attention  effects  begin  at  a  new  location, 
it  seemed  unnecessary  to  adopt  a  position  concerning  the  nature  of  the 
attention  mechanism.  Hone  of  the  present  findings  appears  to  contradict 
either  a  moving  spotlight  (Shulman  et  al . ,  1979),  zoom  lens  (Erixsen  ft 
Yeh,  19d5),  or  gate  opening  (Reeves  &  Sperling,  in  press)  view  of  attention. 

The  present  results  do,  however,  shed  some  light  on  what  Happens  unco 
attention  reaches  a  location.  Attention  seems  to  increase  the  rate  at 


which  information  from  the  target  location  accumulates,  as  reflected  in  an 
increased  proportion  of  correct  discriminations.  This  rate  increase  is 
not  simply  a  reflection  of  better  overall  performance  with  increasing 
attention.  The  data  show  that  when  sufficient  time  was  allowed  for 
shifting  attention,  most  of  the  useful  target  information  was  extracted 
between  34  and  50  msec,  and  performance  increased  little  between  50  and  84 
msec.  However,  when  less  shifting  time  was  available,  information  was 
still  accumulating  at  the  longer  target  durations. 

Conclusion 

These  experiments  give  evidence  that  internal  attention  shifts  are 
fast  enough  to  help  construct  the  visual  percept.  Since  attention  can 
apparently  shift  within  a  few  milliseconds,  it  could  possibly  be  used  in 
conjoining  features  of  objects,  localizing  objects  relative  to  one 
another,  and  perhaps  other  fundamental  aspects  of  perception. 

The  fact  that  it  took  roughly  120  msec  for  the  effects  of  attention 
to  asymptote  in  the  present  experiments,  and  even  longer  in  some  other 
experiments,  does  not  necessarily  mean  that  120  msec  is  the  time  required 
to  change  the  location  being  attended.  Rather,  it  is  likely  that  most  of 
this  time  represents  an  accumulation  of  the  attentional  effects  necessary 
to  discriminate  very  similar  and  briefly  presented  peripheral  targets. 

The  easier  the  discrimination  that  is  required,  toe  faster  a  given  level 
of  performance  could  be  reached. 

In  a  real  scene  composed  of  relatively  distinct  objects,  the  amount 
of  attention  required  to  conjoin  the  color  and  form  of  large  objects  might 
be  minimal.  Thus,  many  rapid  shifts  could  be  executed  in  the  short  time 
that  it  takes  to  perceive  the  main  elements  of  such  a  scene. 
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